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Abstract 

Order the vertices of a directed random graph V\ ,...,«„; edge (v i7 Vj) for i < j exists 
independently with probability p. This random graph model is related to certain spreading 
processes on networks. We consider the component reachable from v\ and prove existence of a 
sharp threshold p* = logn/n at which this reachable component transitions from o(n) to f2(n). 



1 Introduction 

In this note we study a random graph model that captures the dynamics of a particular type of 
spreading process. Consider a set of n ordered vertices {vi, . . . ,v n } with vertex v\ initially 'infil- 
trated' (at time step 1). At time steps 2,3, ... ,n, vertex v\ attempts to independently infiltrate, 
with probability p, each of V2,Vs ... ,v n in turn (one per step). Either Vi gets infiltrated or im- 
munized. If Vi is infected, it attempts to infect . . . ,v n , also each with probability p; v% does 
not attempt to infect vi, . . . , Uj-i, however, as prior vertices are already either infiltrated or immu- 
nized. At time step i, all infiltrated vertices Vj with j < i are attempting to infiltrate Vi, and Vi 
gets infiltrated if any one of these attempts succeeds. Intuitively, Vi is more likely to get infiltrated 
if more vertices are already infiltrated at the time that V{ becomes 'succeptible'. One example of 
such a contagion process is given in [6]. 

This spreading process is equivalent to the following random model of an ordered, directed 
graph G: order the vertices t>i, . . . ,v n , and for i < j, the directed edge (vi,Vj) exists in G with 
probability p (independently) . Vertex Uj is infected if there is a (directed) path from v\ to Vi . The 
question we address is, "What is the size of the set of vertices reachable from ui?" (the size of the 
infection). We prove the following sharp result. 

Theorem 1. Let 1Z be the set of vertices reachable from v\, and suppose p = cl ° gn + £(n) ; where 
£(n) = o(-^p) and c > is fixed. Then: 

1. Ifc< I, then \n\ = n c+ °W, a.a.s. 
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2. If c= 1, then \R\ = o(n), a.a.s. 

3. Ifc> 1, then \K\ = (l - \ +o(l)) n, a.a.s. 

Recall that an event holds a.a.s. (asymptotically almost surely), if it holds with probability 
1 — o(l); that is it holds with probability tending to one as n tends to infinity. Note that we do 
not explicitly care whether is positive or negative in the results above. 

Similar phase transitions are well known for various graph properties in other random graph 
models. As shown by Erdos and Renyi in [2], in the G(n,M) model of random graphs, where a 
graph is chosen independently from all graphs with M edges, there is a similar emergence of a 
component of size O(n) around M = ^ edges. Likewise, a threshold for connectivity was shown for 
M = ral ° g " edges. For the more familiar G(n,p) model, where edges are present independenty with 
probability p, this translates into a threshold at p = - for a giant component, and at p = for 
connectivity. A much more comprehensive account of results on properties of random graphs can 
be found in pQ . Luczak in [4 J and more recently Luczak and Seierstad in [5] , studied the emergence 
of the giant component in a random directed graphs, in both the directed model where M random 
edges are present and in the model where edges are present with probability p. Thresholds for 
strong connectivity were established for random directed graphs by Palasti [7] (for random directed 
graphs with M edges) and Graham and Pike [3] (for random directed graphs with edge probability 
p) . We are not aware of any results for ordered directed random graphs where edges connect vertices 
of lower index to higher index. 



Upper bounds: For % > 1, let TZi denote the event that Vi is reachable, and let Xj denote the 
number of paths to vertex Vi in G. If Vi denotes the set of all potential paths from v\ to Uj, then 
Xi = ^2 xe p. I(x) where I(x) is a {0, 1} indicator random variable indicating whether the path x 
exists in G; I(x) = 1 if and only if all edges in the path x are present in G. Then, 



2 A Proof of Theorem Q] 



F(Hi) = F(Xi > 1) < ELY;] = E i [ ( x )] 
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Let X denote the number of reachable vertices (other than i?i). 
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+ £(n) with c < 1, 

e p(n+i) _-y = exp ( c iogn + o(logn)) — 1 = n' 
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EV =i+o(p). 



(eP - 1) Vfct fc! , 
Thus, 

E[X] < n c+ °«. 

Applying Markov's inequality yields that F(X > log(n)E[X]) = o(l), so X < log(n)E[X] = n c+ °W, 
a.a.s. 

Now consider c > 1. Let 

n n 2 £(n) 
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£'(n) := - max < 



log log n clogn 



- - 



Note that by our choice of £'(n), and the fact that £(n) = o(^p), that £'(n) = o(n). Then, 

P(^) < pe pt = pexp ((c^ + £(n)) - £'(n))) 

A , \ . <( n ) c(logn)£'(n) , \ 

= pexp I log(n) + — £(n)£ (n) 1 



i - , / x n£(n) c(logn)£'(n) . 
< (1 + o 1 )cexp loglog(n + - v & ^ — = o 1) 

en' 



Here, the last inequality comes from the fact that, by our choice of £'(n), 

C(1 ° g(n)) ^ /(n) - loglog(n) - > V (n) . 

n c 3 

Since pe p * is increasing in i, the expected number of reachable vertices Vi with i < t is at most 
tP(7£t) = o(n). Applying Markov's inequality, \1Z n {ui, . . . , = o(n) a.a.s. Thus, 

< n - t + \K n {wi, . . . , v t }\ = (1 - \ + o(l)) n a.a.s. 

For p = ^ + £(n) with f (n) = o , we will write f (n) = w(n)^, where w(n) -> 0. Let 

t = n.(l- W (n)- I ^).Then, 

P(^) < pe"* = exp [(1 + u>(n)) (l - w (n) - log n + log ((1 + w(n))^) 



exp 



-u;(n) 2 logn — (1 + oj(n))- — h loglogn + log(l + u(n)) 

log log n 



Thus the expected value of \1Z n {t>i, . . . ,vt}\ is o(n) and by Markov's inequality, this is also true 
a.a.s. Now, since n — t is also o(n), we have that R = o(n) a.a.s. □ 
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To prove the lower bounds, we require a simple lemma similar to Dirichlet's theorem. Let d(i) 
denote the number of divisors of i and let dt(i) denote the number of divisors of i that are at most t. 
Dirichlet's Theorem states that 

k 

E d(i) = k log k + (2 7 - l)k + 0(Vk), 

i=l 

where 7 is Euler's constant. For our purposes, we need a refinement of this result, summing dt{i). 



Lemma 1. E^tW = /clogmin(t, k) + 0(k). 



Proof. For t > k the result follows from Dirichlet's theorem as we may replace dt{i) with d(i) in 
the summation. For t < k, 



E d *W = k + 
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< kn t , 



where Tit is the t-th harmonic number. 



□ 



Lower bounds: For exposition, assume that we construct our graph on countably many vertices 
and that we then restrict our attention to the first n vertices. Let Xi denote the index of the i-th 
reachable vertex (that is not v\). If Xi > n then \R\ < i. Set Xq = 1, and for i > 1, Xi — Xi-\ is 
geometrically distributed with parameter 1 — (1 —p) 1 . Fix t, and consider E[JQ]: 

E[ Xt ] = ^nx k - x k _ ± ] = f: 1 _ ( l_ v)k - 

fc=l fe=l v F! 

Each term is an infinite geometric series, and so 

t 00 

E[X t ]=£X>-p)«. 

k=l j=0 

As this series is absolutely summable (as E[Xf] is clearly finite), Fubini's theorem allows us to 
rearrange terms in the summation to get 

t 00 00 
E[X t ] = * + ££(!- V? 3 = t + Y dt{i)(l ~ P)\ 



k=l j=l 



i=l 



because the term (1 — p) 1 appears in the original summation (where i = kj) once for every divisor i 
has that is at most t. We now use summation by parts to manipulate the second term: 



i=i 
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00 
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Since *(1 ~~ pT = Vi* > we have that 



log i 

Eli = i + — + O 
P 



(1) 



Furthermore, since — Xk and — Xk-i are independent, 

t 



Var(X t ) = £ 



fc\2 



t(l-(l"P) fc ) 



< 



(2) 



Here, the first inequality follows from an application of Cauchy-Schwarz, and the second from 

p 2 < p 2 _ i 
(i-(i- P ) fc ) 3 - p~z ~ p- 



Now, suppose that p = c-^p- + £(n) for c < 1, and set i = n c exp(— n|£(re)| — loglog(n)). Then, 
from eq:Et, 



E[X t ] <n c exp(-n|e(n)|) + 



clog n — 2n\^{n)\ — log log n 



< n c exp(— n|£(n)|) + n 



re -1 (clog n + n£(n)) 
ra 2 |£(re)| — log log n 
(clog n + n£(n)) 



+ o 



log n 
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log re 
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. , n|£(n)| + loglog(n) /n|£(n)| + log log n 
nil — r^—, ho 



(clog n + n^{n)) 
For n sufficiently large, E[X(] < n 1 1 



rt|g(rQ|+loglog71 

2c log n 



Var(X t ) < (1 + o(l)) 



(l + (l))_5_.E[X t ] " 



log n 

. Meanwhile, from @, 
5( 1+c ) MX*} 
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(4) 
(5) 



log n c log n log n 

because E[X<] = 0(n) and c < 1. Chebyshev's inequality asserts that 
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n 



3/2 



log re 



\X t -K[X t ]\ > 



n 2 |£(n)| + n log log n 



2c log n 



< 



4c 2 log 2 n • Ysi(X t ) 
(n 2 |£(n)| + n (log log n)) 2 



o(l). 



Thus, 



X t < E[X t ] + nloglog " 



2c log 71 



X* < n 1 



1 — o(l). Using eq:Etl, 
log log n ( log log n 



2c log n 



+ o 



clog n 



l-o(l), 



i.e., < n a.a.s. Since X 4 < n implies |7£| > t, we have that \1Z\ > n c exp(— n|£(n)| — loglog(n)) 
n c+ °W a.a.s. 

For c > 1, take t = nl ™ n . Then, using eq:Et, 



log n 



n 



E[X t ] <- + o(n). 
c 
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Again, by ([2]) and because E[X t ] = 0(n), Var(X t ) = 0(n 3 /Vloglogn/logn) = o(n 3 / 2 ). Chebyschev's 
inequality asserts that 

'\X t -E[X,]\ >n 3/4 



3/2 



Hence, 



X< < E[X t ] + n 



3/4 



l-o(l). 



(6) 



So, -Xt < ^ + o(n) a.a.s. We now consider the vertices indexed higher than Xt and show that 
essentially all of them are reachable. Let Y be the vertices with index higher than Xt which are 
not adjacent to one of the first t reachable vertices in vi, . . . , vx t - Then 



E[|y| 



j=Xt+l 



(n 



X t )(l - p) 1 < ne~ 



P t 



n 



on 



n 



Applying Markov's inequality, \Y\ = o(n) with probability 1 — o(l). Since the set of vertices indexed 
above Xt that is not reachable is a subset of Y, \1Z\ >t+(n — Xt) — \Y\. Since \Y\,t are o(n) and 
Xt = - + o(n), we have that \1Z\ > n(l — - + o(l)) with probability 1 — o(l), as desired. □ 
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